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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Bisgard-Frantzen, Henrik 
Svendsen, Allan 
Borchert, Torben Vedei 

(ii) TITLE OF INVENTION : AMYLASE VARIANTS 

(iii) NUMBER OF SEQUENCES: 32 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk of North America, Inc. 

(B) STREET: 405 Lexington Avenue, Suite 6400 

(C) CITY: New York 

(D) STATE: New York' 

(E) COUNTRY: U.S.A. 
(F> ZIP: 10174-6401 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/600,656 

(B) FILING DATE: 13-FEB-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lambiris, Eiias J. 

(B) REGISTRATION NUMBER: 33,728 

(C) REFERENCE/ DOCKET NUMBER: 431B.204-US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 867 0123 

(B) TELEFAX: 212 867 0298 



(2) INFORMATION FOR SEQ ID NO: 1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp Us E> ro Pro Ala Trp 
35 40 45 



- 71 



PAGE 3/18* RCVD AT 11/16/2004 11:30:25 AM pastern Standard Time] * SVR:USPTO-EFXRF-3/25 * DNiS:2730937 * CSID:21 28400221 * DURATION (mm-ss):04-10 



11/16/2004 12:30 2128400221 



NOUOZYMES NA 



PAGE 04/18 



Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val^Arg Thr Lys Tyr Gly 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 - 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asd 
1^0 105 " no 

Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Aro Asn 
115 120 125 

130 135 IlG GiU Ala Trp Thr LyS PhG A3p 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tvr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

119 Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val He His Glu Leu Arg Asn Trp Gly Val Tro Tvr 
210 215 220 

Thr Asn Thr Leu Aan Leu Asp Gly Phe Arg He Asp Ala Val Lys H±3 
22 5 230 235 240 

He Lys tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Aan His Ser Val 
275 290 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lye Pro Leu Ala 
340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
J/0 375 390 
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Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 40Q 

Gin His Asp Tyr Phe Asp His Hia Asp He He Gly Trp Thr Arg Glu 
405 410 * 415 

Gly Asn Ser Ser His Pro Ann Ser Gly Leu Ala Thr He Met Ser A3p 
420 425 430 

Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Arg Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
4^0 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

val Trp val Lys Gin 
485 

{2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

.(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 

His His A$n Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 25 30 

Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tvr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
55 7 0 75 ^ 80 

Thr Arg Ser Gin Leu Glu Ser Ala He His Ala Leu Lys Asn Asn Glv 
85 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asd 
1°0 105 no 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu He Ser Gly Asp Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 X55 * 160 
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His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165. 170 175 

He Tyr Lys Phe Arg Gly A9p Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 * 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Tro Tvr 
210 215 220 

Thr Aan Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
22 5 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Aan His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 

His Pro Met His Ala Val Thr Phe Val Asp A3n His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ser Leu Glu $er Phe Val Gin Glu Trp i?he Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu lie Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

Lys lie Asp Pro lie Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr phe Asp His His Asn lie He Gly Trp Thr Arg Glu 
405 410 " 415 

Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

Gin Val Trp His Asp He Thr Gly Asn Lys Pro Gly Thr Val Thr He 
ISO 455 " 4 60 

Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
4 65 470 475 480 

He Trp Val Lys Arg 
485 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
1 5 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 25 30 

Leu Ser Ser Leu Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr T.ys 
35 40 45 

Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 

Leu Gly Glu Phe Asn Gin Lys Gly Ala Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 

Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met 
85 90 95 

Gin val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
100 105 110 

Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin 
115 120 125 

Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe 
130 135 140 

Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145 150 155 160 

Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr 
165 170 175 

Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 
180 185 190 

Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp Hi3 
195 200 205 

Pro Glu Val val Thr Glu Leu Lys $er Trp Gly Lys Trp Tyr Val Asn 
210 215 220 

Thr Thr A$n He Asp Gly Phe Arg Leu A$p Ala Val Lys His He Lys 
225 230 235 240 

Phe Ser Phe Phe Pro Asp Trp Leu Ser Asp Val Arg Ser Gin Thr Gly 
245 250 " 255 

Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He Asn Lys 
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260 265 



270 



Leu His Asn Tyr He Met Lys Thr Asn Gly Thr Met Ser Leu Phe Asp 
275 280 285 

Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Thr 
^ yu 295 300 

Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro 

Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin 
325 330 33 | 

Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
340 3A5 350 

Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 
355 360 365 

Tyr Tyr Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser Lys He 
370 375 38O 

Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 
385 390 395 400 

Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg' Glu Gly Val 
4 °5 410 415 

Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 
42 0 425 430 

Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val 
435 440 445 

Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ser 
450 455 460 

Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Tro 
465 470 475 48O 

Val Pro Arg Lys Thr Thr Val Ser Thr He Ala Trp Ser He Thr Thr 
485 49O ~ 495 

Arg Pro Trp Thr Asp Glu Phe Val Arg Trp Thr Glu pro Arg Leu Val 
500 505 510 

Ala Trp 

(2) INFORMATION FOR SEQ 10 NO: At 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1455 base pairs 

(B) TYPE r nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 
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GGGAATCATT 


GGAACAGGTT 


GAGGG ATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 


120 


GCTGTATGGA 


TCCCACCTGC 


ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC 


180 


TATGATTTAT 


ATGATCTTGG 


AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAATATGGA 


240 


ACACGCAACC 


AGCTACAGGC 


TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 


300 


GGTGATGTCG 


TCATGAATCA 


TAAAGGTGGA GCAGATGGTA CGCAAATTGT AAATGCGGTA 


360 


GAAGTGAATC 


GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 


420 


ACAAAGTTTG 


ATTTTCCTGG 


AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 


480 


CATTTTGATG 


GGACAGATTG 


GGATCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 


540 


AGGGGAACAG 


GCAAGGCCTG 


GGACTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT 


600 


CTTATGTATG 


CAGACGTGGA 


TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG 


660 


GGAGTGTGGT 


ATACGAATAC 


ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 


720 


ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 


780 


ATGTTTGCAG 


TGGCTGAGTT 


TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTAXTTGAAT 


840 


AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 


900 


TCTAATAGCG 


GTGGTTATTA 


TGATATGAGA AATATXTTAA ATGGTTCTGT GGTGCAAAAA 


960 


CATCCAACAC 


ATGCCGTTAC 


TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 


1020 


GAATCCTTTG 


TTCAACAATG 


GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 


1080 


CAAGGTTATC 


CTTCCGTATT 


TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 


1140 


GCTATGAAAT 


CTAAAATAGA 


CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 


1200 


CAGCATGATT 


ACTTTGATCA 


T CAT GAT ATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 


1260 


CATCCAAATT 


CAGGCCTTGC 


CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 


1320 


TATGTGGGGA 


AAAATAAAGC 


GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 


1380 


ACCGTCACAA 


TTAATGCAGA 


CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 


1440 


GTTTGGGTGA 


AGCAA 




1455 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Dna (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 
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GGGAATCACT 


GGAATAGATT 


AAGAGATGAT 


GCTAGTAATC 


TAAGAAATAG AGGTATAACC 


120 


GCTATTTGGA 


XTCCGCCTGC 


CTGGAAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 


180 


TATGATCTTT 


ATGATTTAGG 


GGAATTTAAT 


CAAAAGGGGA 


CGGTTCGTAC 


TAAGTATGGG 


240 


ACACGTAGTC 


AATTGGAGTC 


TGCCATCCAT 


GCTTTAAAGA 


ATAATGGCGT 


TCAAGTTTAT 


300 


GGGGATGTAG 


TGATGAACCA 


TAAAGGAGGA 


GCTGATGCTA 


CAGAAAACGT 


XCTTGCXGTC 


360 


GAGGTGAATC 


CAAATAACCG 


GAATCAAGAA ATAXCTGGGG 


ACXACACAAT 


TGAGGCTXGG 


420 


ACTAAGTTTG ATTTTCCAGG 


GAGGGGTAAT 


ACATACTCAG 


ACTTTAAATG 


GCGXTGGTAT 


480 


CATTTCGATG 


GTGTAGATTG 


GGATCAATCA 


CGACAATTCC 


AAAATCGTAT 


CTACAAATTC 


540 


CGAGGTGATG 


GTAAGGCATG 


GGATTGGGAA 


GTAGATTCGG 


AAAATGGAAA 


TTATGATTAT 


600 


TTAATGTATG 


CAGATGTAGA 


TATGGATCAT 


CCGGAGGTAG 


TAAATGAGCT 


TAGAAGATGG 


660 


GGAGAATGGT 


ATACAAATAC 


ATTAAATCTT 


GATGGATTTA 


GGATCGATGC 


GGXGAAGCAT 


720 


ATTAAATATA 


GCTTTACACG 


TGATTGGTTG 


ACCCATGTAA 


GAAACGCAAC 


GGGAAAAGAA 


780 


ATGTTTGCTG 


TTGCTGAATT 


TTGGAAAAAT 


GATTTAGGTG 


CCTTGGAGAA 


CTATTTAAAT 


840 


AAAACAAACT 


GGAATCATTC 


TGTCTTTGAT 


GTCCCCCTTC 


ATTATAATCT 


TTAXAACGCG 


$00 


TCAAATAGTG 


GAGGCAACTA 


TGACATGGCA 


AAACTTCTTA 


AXGGAACGGT 


TGTTCAAAAG 


960 


CATCCAATGC 


ATGCCGTAAC 


TTTTGTGGAT 


AATCACGATT 


CTCAACCTGG 


GGAATCATTA 


1020 


GAATCATTTG 


TACAAGAATG 


GTTTAAGCCA 


CXTGCXTATG 


CGCXXATTXX 


AACAAGAGAA 


1080 


CAAGGCTATC 


CCTCTGTCTT 


CTATGGTGAC 


TACTATGGAA 


TTCCAACACA 


TAGTGTCCCA 


1140 


GCAATGAAAG 


CCAAGATTGA 


TCCAATCTTA 


GAGGCGCGTC 


AAAATTTTGC 


ATATGGAACA 


1200 


CAACATGATT 


ATTTTGACCA 


TCATAATATA 


ATCGGATGGA 


CACGTGAAGG 


AAAXACCACG 


1260 


CATCCCAAT? 


CAGGACTTGC 


GACTATCATG 


TCGGAXGGGC 


CAGGGGGAGA 


GAAATGGATG 


1320 


TACGTAGGGC AAAATAAAGC 


AGGTCAAGTT 


TGGCATGACA 


TAACTGGAAA 


TAAACCAGGA 


13BQ 


ACAGTXACGA 


TCAATGCAGA 


TGGATGGGCT 


AATTTTTCAG 


XAAATGGAGG 


ATCTGTTTCC 


1440 


ATTTGGGTGA 


AACGA 










1455 



(2) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCCGCACCGT TTAACGGCAC CATGATGCAG TATTTTGAAT GGTACTTGCC GGATGATGGC 
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ACGTTATGGA CCAAAGTGGC CAATGAAGCC AACAACTTAT CCAGCC7TGG CATCACCGCT 120 

CTTTGGCTGC CGCCCGCTTA CAAAGGAACA AGCCGCAGCG ACGTAGGGTA CGGAGTATAC 180 

GACTTGTATG ACCTCGGCGA ATTCAATCAA AAAGGGACCG TCCGCACAAA ATACGGAACA 24 0 

AAAGCTCAAT ATCTTCAAGC CATTCAAGCC GCCCACGCCG CTGGAATGCA AGTGTACGCC 300 

GATGTCGTGT TCGACCATAA AGGCGGCGCT GACGGCACGG AATGGGTGGA CGCCGTCGAA 360 

GTCAATCCGT CCGACCGCAA CCAAGAAATC TCGGGCACCT ATCAAATCCA AGCATGGACG 420 
AAATTTGATT TTCCCGGGCG CGGCAACACC TACTCCAGCT TTAAGTGGCG CTGGTACCAT ' 48 0 

TTTGACGGCG TTGATTGGGA CGAAAGCCGA AAATTGAGCC GCATTTACAA ATTGCGCGGC 540 

ATCGGCAAAG CGTGGGATTG GGAAGTAGAC ACGGAAAACG GAAACTATGA CTACTTAATG 600 

TATGCCGACC TTGATATGGA TCATCCCGAA GTCGTGACCG AGCTGAAAAA CTGGGGGAAA 660 

TGGTATGTCA ACACAACGAA CATTGATGGG TTCCGGCTTG ATGCCGTCAA GCATATTAAG 720 

TTCAGTTTTT TTCCTCATTG GTTGTCGTAT GTGCGTTCTC AGACTGGCAA GCCGCXATTT 780 

ACCGTCGGGG AATATTGGAG CTATGACATC AACAAGTTGC ACAATTACAT TACGAAAACA 840 

GACGGAACGA TGTCTTTGTT TGATGCCC.CG TTACACAACA AATTTTATAC CGCTTCCAAA 900 

TCAGGGGGCG CAfTTGATAT GCGCACGTTA ATGACCAATA CTCTCATGAA AGATCAACCG 960 

ACATTGGCCG TCACCTTCGT TGATAATCAT GACACCGAAC CCGGCCAAGC GCTGCAGTCA 1020 

TGGGTCGACC CAXGGTTCAA ACCGTTGGCT TACGCCTTTA TTCTAACTCG GCAGGAAGGA 1080 

TACCCGTGCG TCTTTTATGG TGACTATTAT GGCATTCCAC AATATAACAT TCCTTCGCTG 114 0 

AAAAGCAAAA TCGATCCGCT CCTCATCGCG CGCAGGGATT ATGCTTACGG AACGCAACAT 1200 

GATTATCTTG ATCACTCCGA CATCATCGGG TGGACAAGGG AAGGGGGCAC TGAAAAACCA 1260 
GGATCCGGAC TGGCCGCACT GATCACCGAT GGGCCGGGAG GAAGCAAATG GATGTACGTT ' 1320 

GGCAAACAAC ACGCTGGAAA AGTGTTCTAT GACCTTACCG GCAACCGGAG TGACACCGTC 1330 

ACCATCAACA GTGATGGATG GGGGGAATTC AAAGTCAATG GCGGTTCGGT TTCGGTTTGG 144 0 

GTTCCTAGAA aaacgaccgt ttctaccatc GCTCGGCCGA TCACAACCCG ACCGTGGACT 1500 

GGTGAATTCG TCCGTTGGAC CGAACCACGG TTGGTGGCAT GGCCTTGA 1540 

(2) INFORMATION FOR $EQ ID NO: 7: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 485 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY: linear 

(ii) M0L5CULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe GJLu Trp Tyr 
1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Asn Ser Asp Ala Ser 
20 25 30 

Asn Leu Lys Ser Lys Gly He Thr Ala Val Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 $o 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 " ao 

Thr Arg Ser Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 go 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 Ho 

Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu Val Thr Gly Glu Tyr Thr He Glu Ala Trp Thr Arg Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr His Ser Ser Phe Lys Trp Arg Trp Tyr 
i4S 150 155 " 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Arg Leu Asn Asn Arg 

165 170 175 

He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 

180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 " 220 

Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
22 5 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp He Asn His Val Arg Ser Ala 
245 250 255 

Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Glv 
29 0 295 300 

Gly Asn Tyr Asp Met Arg Asn He Phe Asn Gly Thr Val Val Gin Arg 
305 31 0 315 320 

His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 
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Glu Glu Ala Leu Glu Ser Phe. Val Glu Glu Trp Phe Lys Pro Leu Ala 
340 345 35 0 

Tyr Ala Leu Thr Lou Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Arg Ser 
370 375 380 

385 SS G1U Ala Gln T * r Ala T * r G1 V L £ s 

Gin Asn Asp Tyr Leu Asp Hia His Asn Ile lie Gly Trp Thr Arg Glu 
405 410 43.5 

Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr II© Met Ser Asr> 
4*0 425 430 P 

Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Ser Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
^ 5 470 475 480 

He Trp Val Asn Lys 
485 

(2) INFORMATION FOR $EQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: $EQ ID NO: 8: 
GCTGCGGTGA CCTCTTTAAA AAATAACGGC 

{2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCACCGCTAT TAGATGCATT GTAC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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32 



34 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTACGTATG CAGACGTCGA TATGGATCAC CC 

(2) INFORMATION FOR SEQ ID NO: 11; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 11; 
GATCCATATC GACGTCTGCA TACGTAAGAT AGTC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH.: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 12: 
TTASGGGCAA GGCCTGGGAC TGG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCCAGGCCTT GCCCSTAAAT TTATATATTT TGTTTTG 37 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
fD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGTTTCGGTT CGAAGGATTC ACTTCTACCG C 31 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS ' 
(A) LENGTH: 33 base pairs 
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(3) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGGTAGAAG TGAATCCTTC GAACCGAAAC CAG 33 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGTACTATCG TAACAATGGC CGATTGCTGA CGCTGTTATT TGC 43 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ to NO: 17: 
CTGTGACTGG TGAGTACTCA ACCAAGTC 28 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTACTTCCCA ATCCCAAGCT TTACCTCGGA ATTTG 35 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) Type: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAAATTCCGA GGTAAAGCTT GGGATTGGGA AGTAG 35 

(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO:20: 
TTGAACAACC GTTCCATTAA GAAG 24 

i2) INFORMATION FOR SEQ ID NO: 21: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT TGTCCTGAAT T TATA TAT TT TGTTTTGAAG 60 

(2) INFORMATION FOR SEQ ID NO: 22: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTCTGTATCG ACTTCCCAGT CCCAAGCTTT GCCTCCGAAT TTATATATTT TGTTTTGAAG 60 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ATGTGTAAGC CAATCGCGAG TAAAGCTAAA TTTTATATGT TTCACTGCAT C 51 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS ' 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ tD NO: 24: 
GCACCAAGGT CATTTCGCCA GAATTCAGCC ACTG 34 
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(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TGTCAGAACC AACGCGTATG CACATGGTTT AAACCATTG 39 

(2) INFORMATION FOR SEQ ID NO:26; 

(i) SEQUENCE CHARACTERISES : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ACCACCTGGA CCATCGCTGC AGATGGTGGC AAGGCCTGAA TT 42 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGCAAAAGTT TGACGTGCCT CGAGAAGAGG GTCTAT 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 ba3e pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;28: 
TTCTCCCGCT TTATTCTGGC CAACATACAT CCATTT 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 37 base pairs 
IB) TYPE; nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CCCAATCCCA AGCTTTACCA YCGAACTTGT AGATACG 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
CCCAATCCCA AGCTTTATCT C5GAACTTGT AGATACG 



(2) INFORMATION FOR SEQ ID NO: 31; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GATCCATATC GACGTCTGCA TACAGTAAAT AATC 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
(Q) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GATCCATATC GACGTCTGCA TAAATTAAAT AATC 
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